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ENCODED COMBINATORIAL CHEMICAL LIBRARIES 
Description 

Technical Field 

5 The present invention relates to encoded chemical 

libraries that contain repertoires of chemical 
structures defining a diversity of biological 
structures, and methods for using the libraries. 

10 Background 

There is an increasing need to find new molecules 
which can effectively modulate a wide range of 
biological processes, for applications in medicine and 
agriculture. A standard way for searching for novel 
15 bioactive chemicals is to screen collections of 

natural materials, such as fermentation broths or 
plant extracts, or libraries of synthesized molecules 
using assays which can range in complexity from simple 
binding reactions to elaborate physiological 
20 _ preparations. The screens often only provide leads 
which then require further improvement either by 
empirical methods or by chemical design. The process 
it time-consuming and costly but it is unlikely to be 
totally replaced by rational methods even when they 
are based on detailed knowledge of the chemical 
structure of the target molecules. Thus, what we 
might call "irrational drug design" - the process of 
selecting the right molecules from large ensembles or 
repertoires - requires continual improvement both in 
the generation of repertoires and in the methods of 
selection. 

Recently there have been several developments in 
using peptides or nucleotides to provide libraries of 
compounds for lead discovery. The methods were 
originally developed to speed up the determination of 



25 



30 



35 
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reiterated in this fashion until the active 
hexapeptide is defined. This is analogous to the 
method used in searching a dictionary; the peptide is 
decoded by construction using a series of sieves or 
5 buckets and this makes the search logarithmic. 

A very powerful biological method has recently 
been described in which the library of peptides is 
presented on the surface of a bacteriophage such that 
each phage has an individual peptide and contains the 

10 DNA seguence specifying it. The library is made by 
synthesizing a repertoire of random oligonucleotides 
to generate all combinations, followed by their 
insertion into a phage vector. Each of the sequences 
is cloned in one phage and the relevant peptide can be 

15 selected by finding those that bind to the particular 

target. The phages recovered in this way can be 
amplified and the selection repeated. The sequence of 
the peptide is decoded by sequencing the DNA. See for 
example Cwirla et al. , Proc . Natl . Acad . Sci . USA , 

20 87:6378-6382 (1990); Scott et al., Science, 249:386- 

390 (1990); and Devlin et al., Science . 249:404-406 
(1990). 

Another "genetic" method has been described where 
the libraries are the synthetic oligonucleotides 

25 themselves wherein active oligonucleotide molecules 
are selected by binding to an acceptor and are then 
amplified by the polymerase chain reaction (PCR) . PCR 
allows serial enrichment and the structure of the 
active molecules is then decoded by DNA sequencing on 

30 clones generated from the PCR products. The 

repertoire is limited to nucleotides and the natural 
pyrimidine and purine bases or those modifications 
that preserve specific Watson-Crick pairing and can be 
copied by polymerases. 

35 The main advantages of the genetic methods reside 
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molecule of interest or by identifying a species in 
the library having a desired activity, including 
binding, activation, chemical catalysis and the like. 
Thereafter, the identity of the active molecule is 
5 determined by reading the genetic tag, i.e., the 
identifier oligonucleotide sequence. In one 
embodiment, amplified copies of their retrogenetic 
tags can be obtained by the polymerase chain reaction. 
The strands of the amplified copies with the 

10 appropriate polarity can then be used to enrich for a 
subset of the library by hybridization with the 
matching tags and the process can then be repeated on 
this subset. Thus serial enrichment is achieved by a 
process of purification exploiting linkage to a 

15 nucleotide sequence which can be amplified. Finally, 

the structure of the chemical entities are decoded by 
cloning and sequencing the products of the PCR 
reaction. 

The present invention therefore provides a novel 

20 method for identifying a chemical structure having a 
preselected binding or catalysis activity through the 
use of a library of bifunctional molecules that 
provides a rich source of chemical diversity. The 
library is used to identify chemical structures 

25 (structural motifs) that interact with preselected 

biological molecules. 

Thus, in one embodiment, the invention 
contemplates a bifunctional molecule according to the 
formula A-B-C, where A is a chemical moiety, B is a 

30 linker molecule operatively linked to A and C, and C 

is an identifier oligonucleotide comprising a sequence 
of nucleotides that identifies the structure of 
chemical moiety A. 

In another embodiment, the invention contemplates 

35 a library comprising a plurality of species of 
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aliquots of the composition to produce aliquots that 
contain a product containing a bifunctional molecule; 

d) combining the aliquots produced in step 
(c) to form an admixture of bifunctional molecules, 
5 thereby forming said library. 

In a related embodiment, the invention describes 
a bifunctional solid support for synthesizing an 
oligopeptide/ oligonucleotide conjugate comprising: 
(1) a solid support, the solid support being of a type 

10 which is dispersible in aqueous solution, (2) a first 
linkage unit coupled to the solid support, (3) a 
second linkage unit coupled to the first linkage unit, 
and (4) a bifunctional unit coupled to the second 
linkage unit, wherein the bifunctional unit having a 

15 first leaving group employable for oligopeptide 

synthesis and a second leaving group employable for 
oligonucleotide synthesis, wherein the first leaving 
group is N-FMOC or its functional equivalent, the 
second leaving group is O-DMT or its functional 

20 equivalent, wherein the second linkage unit is coupled 
to the first linkage unit by means of a bond cleavable 
by exposure to concentrated aqueous ammonia, and 
wherein the solid support, the first linkage unit, the 
second linkage unit, the cleavable bond, and the 

25 bifunctional unit, exclusive of the first and second 
leaving groups, each being substantially chemically 
unreactive to conditions employed by oligopeptide 
synthetic protocols using FMOC leaving groups and 
conditions employed by oligonucleotide synthetic 

30 protocols using O-DMT leaving groups. 

Further contemplated is an element of a library 
of oligopeptide/ oligonucleotide conjugates comprising: 
(1) a solid support being of a type which is 
dispersible in aqueous solution, (2) a first linkage 

35 unit coupled to the solid support, (3) a second 
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elements of a polymer comprised of a series of 
chemical building blocks to form a chemical moiety in 
the library, and a code for identifying the structure 
of the chemical moiety - 
5 Thus, a bifunctional molecule can be represented 

by the formula A-B-C, where A is a chemical moiety, B 
is a linker molecule operatively linked to A and C, 
and C is an identifier oligonucleotide comprising a 
sequence of nucleotides that identifies the structure 
10 of chemical moiety A. 

a. Chemical Polymers 

A chemical moiety in a 
bifunctional molecule of this invention is represented 

15 by A in the above formula A-B-C and is a polymer 
comprising a linear series of chemical units 
represented by the formula (X n ) a , wherein X is a 
single chemical unit in polymer A and n is a position 
identifier for X in polymer A. n has the value of 1+i 

20 where i is an integer from 0 to 10, such that when n 
is 1, X is located most proximal to the linker (B) . 

Although the length of the polymer can vary, 
defined by a, practical library size limitations arise 
if there is a large alphabet size as discussed further 

25 herein. Typically, a is an integer from 4 to 50. 

A chemical moiety (polymer A) can be any of a 
variety of polymeric structures, depending on the 
choice of classes of chemical diversity to be 
represented in a library of this invention. Polymer A 

30 can be any monomeric chemical unit that can be coupled 

and extended in polymeric form. For example, polymer 
A can be a polypeptide, oligosaccharide, glycolipid, 
lipid, proteoglycan, glycopeptide, sulfonamide, 
nucleoprotein, conjugated peptide (i.e., having 

35 prosthetic groups) , polymer containing enzyme 
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15 



F 


Phe 


Dhenvlalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 




W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


C 


Cys 


cysteine 



20 The phrase "amino acid residue" is broadly 

defined to include the amino acids listed in the Table 
of Correspondence and modified and unusual amino 
acids, such as those listed in 37 C.F.R. §1.822 (b) (4) , 
and incorporated herein by reference. 

25 The polymer defined by chemical moiety A can 

therefore contain any polymer backbone modifications 
that provide increased chemical diversity. In 
building of a polypeptide system as exemplary, a 
variety of modifications are contemplated, including 

30 the following backbone structures: -NHN(R)CO-, 

-NHB(R)CO-, -NHC(RR' )CO-, -NHC (=CHR) CO- , -NHC 6 H 4 CO-, 

* -NHCH 2 CHRCO- , -NHCHRCH 2 CO- , and lactam structures. 

In addition, amide bond modifications are 

* contemplated including -COCH 2 ~, -COS-, -CONR, -COO-, 
35 -CSNH-, -CH 2 NH-, -CH 2 CH 2 -, -CILjS-, -CH 2 SO-, -CH 2 S0 2 -, 
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Insofar as adenosine (A) , guanosine (G) , 
thymidine (T) and cytidine (C) represent the typical 
choices of deoxynucleotides for inclusion in a unit 
identifier oligonucleotide, A f G, T and C form a 
5 representative "alphabet" used to "spell" out a unit 

identifier oligonucleotide's sequence. Other 
nucleotides or nucleotide analogs can be utilized in 
addition to or in place of the above four nucleotides, 
so long as they have the ability to form Watson-Crick 
10 pairs and be replicated by DNA polymerases in a PCR 

reaction. However, the nucleotides A, G, T and C are 
preferred. 

For the design of the code in the identifier 
oligonucleotide, it is essential to chose a coding 
15 representation such that no significant part of the 

oligonucleotide sequence can occur in another 
unrelated combination by chance or otherwise during 
the manipulations of a bifunctional molecule in the 
library. 

20 For example, consider a library where Z is a 

trinucleotide whose sequence defines a unique chemical 
unit X. Because the methods of this invention- provide 
for all combinations and permutations of an alphabet 
of chemical units, it is possible for two different 

25 unit identifier oligonucleotide sequences to have 
closely related sequences that differ by only a 
frame shift and therefore are not easily 
distinguishable by hybridization or sequencing unless 
the frame is clear. 

30 Other sources of misreading of a unit identifier 

oligonucleotide can arise. For example, mismatch in 
DNA hybridization, transcription errors during a 
primer extension reaction to amplify or sequence the 
identifier oligonucleotide, and the like errors can 

35 occur during a manipulation of a bifunctional 
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discussed in more detail herein. 

In a preferred embodiment, an identifier 
oligonucleotide C has a nucleotide sequence according 
to the formula Pl-(Z n ) a -P2, where PI and P2 are 
5 nucleotide sequences that provide polymerase chain 

reaction (PCR) primer binding sites adapted to amplify 
the polymer identifier oligonucleotide. The 
requirements for PCR primer binding sites are 
generally well known in the art, but are designed to 

10 allow a PCR amplification product (a PCR-amplif ied 
duplex DNA fragment) to be formed that contains the 
polymer identifier oligonucleotide sequences. 

The presence of the two PCR primer binding sites, 
PI and P2, flanking the identifier oligonucleotide 

15 sequence (Z n ) a provides a means to produce a PCR- 

amplif ied duplex DNA fragment derived from the 
bifunctional molecule using PCR. This design is 
useful to allow the amplification of the tag sequence 
present on a particular bifunctional molecule for 

20 cloning and sequencing purposes in the process of 

reading the identifier code to determine the structure 
of the chemical moiety in the bifunctional molecule. 

More preferred is a bifunctional molecule where 
one or both of the nucleotide sequences PI and P2 are 

25 designed to contain a means for removing the PCR 

primer binding sites from the identifier 
oligonucleotide sequences. Removal of the flanking PI 
and P2 sequences is desirable so that their sequences 
do not contribute to a subsequent hybridization 

30 reaction. Preferred means for removing the PCR primer 
binding sites from a PCR amplification product is in 
the form of a restriction endonuclease site within the 
PCR-amplif ied duplex DNA fragment. 

Restriction endonucleases are well known in the 

35 art and are enzymes that recognize specific lengths of 
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termini. Step 2 illustrates the specific 
biotinylation of the anticoding strand at the Sty I 
site, whereby the incorporation of biotinylated UTP is 
indicated by a B. 
5 The presence of non-overlapping cohesive termini 

after Step 1 in Figure 1 allows the specific and 
directional cloning of the restriction-digested PCR- 
amplified fragment into an appropriate vector, such as 
a sequencing vector. In addition, the Sty I was 

10 designed into Pi because the resulting overhang is a 
substrate for a f illing-in reaction with dCTP and 
biotinyl-dUTP (BTP) using DNA polymerase Klenow 
fragment. The other restriction site, Apa I, was 
selected to not provide substrate for the above 

15 biotinylation, so that only the anticoding strand can 
be biotinylated. 

Once biotinylated, the duplex fragment can be 
bound to immobilized avidin and the duplex can be 
denatured to release the coding sequence containing 

20 the identifier nucleotide sequence, thereby providing 
purified anticoding strand that is useful as a 
hybridization reagent for selection of related boding 
strands as described further herein. 



25 c. Linker Molecules 

A linker molecule in a 
bifunctional molecule of this invention is represented 
by B in the above formula A-B-C and can be any 
molecule that performs the function of operatively 

30 linking the chemical moiety to the identifier 
oligonucleotide . 

Preferably, a linker molecule has a means for 
attaching to a solid support, thereby facilitating 
synthesis of the bifunctional molecule in the solid 

35 phase. In addition, attachment to a solid support 
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of a 2-hydroxymethyl benzoic acid moiety provides a 
site cleavable by the combination of alkaline 
phosphatase treatment followed by mild alkaline 
treatment. Thus, the incorporation of selectively 
5 cleavable linkers other than those recited above are 

also considered a part of the invention. 

Solid supports for chemical synthesis are 
generally well known. Particularly preferred are the 
synthetic resins and controlled pore glass (CPG) 

10 supports used in oligonucleotide and in polypeptide 

synthesis that are available from a variety of 
commercial sources including Glen Research (Herndon, 
VA) , Bachem Biosciences (Philadelphia, PA) , Sigma 
Chemical Co. (St. Louis, MO) , CPG Inc., (Fairfield, 

15 NJ) and Applied Biosystems (Foster City, CA) . Most 

preferred are teflon and CPG supports such as are 
described in Example 2. 

In a related embodiment, the invention describes 
a preferrred bifunctional solid support particularly 

20 suited for producing a bifunctional molecule of this 

invention having a polypeptide as the chemical moiety 
(i.e., an oligopeptide/ oligonucleotide conjugate). 

A preferred bifunctional solid support for 
synthesizing oligopeptide/oligonucleotide conjugates 

25 comprises a solid support, a first linkage unit, a 
second linkage unit, and a bifunctional unit. A 
"bifunctional unit" in this context is not to be 
confused with a bifunctional molecule of this 
invention, and refers instead to that chemical moiety 

30 present on a bifunctional solid support which provides 
the two (bi-) reactive functionalities, one for 
coupling the peptide and another for coupling the 
oligonucleotide. An exemplary bifunctional unit is 
shown in Figure 3 as the serine-branch monomer 

35 following the aminohexanol linker. 
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and the bifunctional unit, exclusive of said first and 
second leaving groups, are each substantially 
chemically unreactive under conditions employed during 
conventional oligopeptide synthesis protocols using 
5 FMOC leaving groups [see: Bodanszky et al., in The 
practice of Peptide Synthesis, Springer-Verlag, 
(1984); and Bodanszky et al., in Principles of 
Peptide Synthesis, Springer-Verlag, (1984)] and under 
conditions employed during conventional 

10 oligonucleotide synthesis protocols using O-DMT 
leaving groups and phosphoramidite donors. 

In an alternative embodiment, the bifunctional 
solid support also includes a third linkage unit. The 
third linkage unit is interposed between and coupled 

15 to the bifunctional unit and the first leaving group. 

In a preferred mode of this embodiment, the third 
linkage unit is photosensitive, i.e. it is cleavable 
by exposure to ultra-violet light. The third linkage 
unit may include a 3-nitro-4-bromomethyl benzoate 

20 group coupled by means of an amide bond to the amino 

end of the bifunctional serine residue and coupled by 
means of an ester bond to an FMOC blocked amino acid. 



— 2. Libraries 

25 A library of this invention is a 

repertoire of chemical diversity comprising a 
plurality of species of bifunctional molecules 
according to the present invention. The plurality of 
species in a library defines a family of chemical 

30 diversity whose species each have a different chemical 

moiety. Thus the library can define a family of 
peptides, lipids, oligosaccarides or any of the other 
classes of chemical polymers recited previously. 
~ The number of different species in a library 

35 represents the complexity of a library and is defined 
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molecule to be assayed and a bifunctional molecule. 

If, for example, the binding threshold is lO* 6 *! 
(micromolar) , then there must be at least one nanomole 
of each species in a library of 1 milliliter (ml) 
5 volume. At this threshold, a library having a 

complexity of 10* could contain 10 micromoles of each 
species. Because of the reciprocal relationship 
between library complexity and binding threshold, more 
complex libraries are possible where the binding 

10 threshold is lower. 

The relative amounts of the individual 
bifunctional molecule species within the library can 
vary from about 0.2 equivalents to about 10 
equivalents, where an equivalent represents the 

15 average amount of a species within the library. 

Preferably each species is present in the library in 
approximately equimolar amounts. 

In a preferred embodiment, a library contains the 
complete repertoire of chemical diversity possible 

20 based on the mathematical combinations for a given 
library where there is a fixed alphabet and a 
preselected number of chemical units in all species of 
the library. Thus a complete repertoire is one that 
provides a source of all the possible chemical 

25 diversity that can be found in a library of this 

invention having a fixed alphabet and chemical length. 

It is particularly preferred that a library be 
comprised of bifunctional molecules where each species 
of bifunctional molecule contains the same nucleotide 

30 sequence for either the PI or P2 PCR primer binding 

sites. A library with this design is particularly 
preferred because, when practicing the methods of this 
invention, a single PCR primer pair can be used to 
amplify any species of identifier oligonucleotide 

35 (coding sequence) present in the library. 
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attached to said bifunctional unit, and an 
oligonucleotide attached to said bifunctional unit. 
An alternative to this embodiment includes a cleavable 
bond for coupling the bifunctional unit to the 
5 oligopeptide. The cleavable bond may be of a type 

which is photosensitive, i.e. cleavable by exposure to 
ultraviolet light, as indicated above. 

B. Methods for Producing a Library 
10 The present method for producing a plurality 

of bifunctional molecules to form a library of this 
invention solves a variety of problems regarding 
efficient synthesis of large numbers of different 
species. 

15 In the present synthesis methods, the sequential 

steps of first adding a chemical unit X followed by 
the addition of an oligonucleotide sequence to the 
linker molecule requires an alternating parallel 
synthesis procedure to add chemical unit X and then 

20^ add a unit identifier nucleotide sequence Z that 

defines (codes for) that corresponding chemical unit. 
The library is built up by the repetition of this 
alternating parallel process after pooling and 
division of the reaction products as described herein. 

25 The only constraint for making an encoded library 

is that there must be compatible chemistries between 
the two alternating syntheses procedures for adding a 
chemical unit as compared to that for adding a 
nucleotide or oligonucleotide sequence. 

30 The problem of synthesis compatibility is solved 

by the correct choice of compatible protecting groups 
as the alternating polymers are synthesized, and by 
the correct choice of methods for deprotection of one 
growing polymer selectively while the other growing 

35 polymer remains blocked, such as by the use of 
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identifier oligonucleotide Z first to termini C 1 . A 
first cycle involves the steps of deprotecting the 
termini of the linker to which a building block is to 
be added and then adding the building block to the 
5 termini. Typically, the added building block contains 
a blocking group at its free termini, i.e., the 
termini that will participate in an addition of the 
next building block of its type. The linker molecule 
is then subjected to a second cycle of synthesis to 

10 add a building block at the other (second) termini. A 
second cycle involves the steps of deprotecting the 
second termini of the linker to which a building block 
is to be added and then adding the building block to 
the termini. Again, the added building block is 

15 typically blocked at its free termini. 

The addition of identifier oligonucleotide Z to 
termini C can be conducted either nucleotide by. 
nucleotide to form the complete unit identifier 
nucleotide sequence Z, or Z can be presynthesized, and 

20 the oligonucleotide Z added as a block to termini C». 

Insofar as the synthesis of oligonucleotides is well 
known in the arts, the presynthesis of 
oligonucleotides, and their addition to the growing 
nucleotide polymer in blocks is preferred because it 

25 reduces the number of manipulations in synthesizing a 
bifunctional molecule. 

A chemical unit X or a unit identifier 
oligonucleotide Z is referred to as a precursor (X 1 or 
Z 1 ) to indicate that it contains a leaving group 

30 compatible with the reaction chemistry that 

facilitates the precursor's operative linkage to the 
growing polymer at the appropriate termini. 

The product resulting from step (2) is a 
bifunctional molecule having the structure A'-Xj-B-Zj- 

35 C 1 , and is ready for a repetition of the above first 
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Thus the steps of (i) dividing a linker or pool 
into aliquots, (ii) parallel addition of X and Z to 
the linker substrate in separate aliquots, and (iii) 
pooling of the aliquots, can be cycled (repeated) to 
5 sequentially add the chemical units and their 

corresponding unit identifier oligonucleotides to form 
the library comprising a plurality of bifunctional 
molecules each having a different chemical polymer 
operative ly linked through the linker to a 

10 corresponding identifier oligonucleotide. 

In a preferred embodiment, a method for forming a 
library of this invention includes the steps for 
addition of the PCR primer binding sites PI and P2 to 
each of the bifunctional molecules in the library. 

15 The method is substantially the same as above, 

but includes the addition of a series of nucleotides 
or a presynthesized PI oligonucleotide to the linker 
molecule provided in step (1) prior to the cycles of 
step (2) that add X and Z. Because all members of the 

20 library are to contain the same PI sequence, PI is 

added to the C termini of linker molecule A'-B-C 
prior to dividing the linker into aliquots and • 
subjecting the aliquots to the cycles of step (2) 
adding X 1 and Z r The resulting product has the 

25 formula A'-B-Pl-C". 

Thereafter, the product is aliquoted and cycled 
as before, resulting in the preparation of the product 
A'-fX^-B-Pl-CZ^-C 1 , where a indicates the presence 
of a polymer of length "a". 

30 Next, the pooled admixture containing product 

A'-CXJg-B-Pl-fZ^-C is subjected to the addition of 
a series of nucleotides or a presynthesized 
oligonucleotide P2 at termini C» to form the product 
A , -(X n ) a -B-Pl-(Z n ) a -P2-C l . Thus all members of the 

35 library contain a common sequence PI and a common 
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to any particular blocking group. Also indicated in 
Table 1 are preferred blocking groups. 

TABLE 1 

5 



10 



20 



25 



Amino Acid 


Blockina Group 


Arginine 


N-MTr 1 , N-PMC 7 


Histidine 


N*-Bum 2 , SEM 10 , FMOC, DNP 


Cysteine 


S-Trt 3 , Acm 1 1 , S-t-buty 1 


Tryptophan 


N'-CHO, none 


Tyrosine 


0-TBS A 


Aspartic acid 


O-TSE 5 , DMB 12 


Glutamic acid 


O-TSE 5 , DMB 


Serine 


O-TBS 4 


Threonine 


O-TBS* 


Lysine 


N-Bz 6 . TFA 8 TEOC 9 


Asparagine 


none 


Glutamine 


none 


Glycine 


none 


Phenylalanine 


none 


Methionine 


none 


Alanine 


none 


Isoleucine 


none 


Leucine 


none 


Valine 


none 


Proline 


none 



MTr is N 9 -4-methoxy-2, 3 , 6-trimethylbenzene 
sulfonyl. 

Bum is tert-butoxymethyl . 

Trt is triphenylmethyl. 

TSE is trimethylsylilethylester. 

TBS is tert-butyl-dimethylsilylester. 

Bz is benzyl. 

PMC is N G -2, 2,5,7, 8-pentamethylchromon-6- 
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yl) -1, 1, 3 , 3-tetramethyluronium hexaf luorophosphate. 
TBTU is 2-(lH-benzotriazole-l-yl)-l,l,3,3- 
tetramethyluronium tetraf luoroborate. pyBOP is 
bensotriazol-l-yloxytris[dimethylamino phosphonium 
5 hexaf luorophosphate. The addition reaction requires 

the blocked amino acid, dimethylf ormamide (DMF) and 
hydroxy-benzotriazole (HOBt) as is well known for 
peptide synthesis. The resulting product contains an 
added amino acid residue with a FMOC-blocked amino 

10 terminus, ready for deblocking addition of a 
subsequent blocked amino acid as before . 

For synthesis of a polypeptide on the linker 
substrate in the direction of amino to carboxy 
terminus, a free carboxy terminus on the linker is 

15 required that can be conveniently blocked and 

deblocked as needed. A preferred carboxy terminus 
blocking/ activating group is the Opfp ester described 
before, A carboxy terminus on the linker is produced 
by reacting a linker with a free amino terminus with 

20 succinamide in HOBT and a proton catalyst. 

Thereafter, the terminus can be modified by reaction 
with pentaf luorophenol in dicyclohexylcarbodiiniide 
(DCC) and ethanol acetate to form an Opfp ester at the 
— free carboxy terminus. The Opfp ester is blocked 

25 linker terminus is available for addition reaction 

with a FMOO, Opfp- blocked amino acid as before, but 
with the amino acid adding to the linker in the 
reverse direction. The resulting product contains an 
added amino acid residue with an Opfp-blocked 

30 terminus, ready to repeat the addition with a 
subsequent blocked amino acid. 



35 



b. Oligonucleotide Synthesis 
Oligonucleotides can be 
synthesized by a variety of chemistries as is well 
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ready for deblocking and addition of a subsequent 
blocked nucleotide as before. 

For synthesis of an oligonucleotide on the linker 
in the direction of 5' to 3' ( a free hydroxy terminus 

5 on the linker is required as before. However, the 
blocked nucleotide to be added has the blocking 
chemistries reversed on its 5 1 and 3' termini to 
facilitate addition in the opposite orientation. 

A nucleotide with a free 3' hydroxy 1 and 5' DMT 
10 ether is first blocked at the 3 1 hydroxy terminus by 
reaction with TBS-Cl in imidazole to form a TBS ester 
at the 3' terminus. Then the DMT- blocked 5' terminus 
is deblocked with DCA in DCM as before to form a free 
5' hydroxy terminus. The reagent (N,N- 

15 diisopropylamino) (cyanoethyl) phosphonamidic chloride 
is reacted in tetrahydrofuran (THF) with the 5' 
deblocked nucleotide to form the aminodiisopropyl-, 
CNE- blocked phosphoramidite group on the 5 1 terminus. 
Thereafter the 3' TBS ester is removed with 

20 tetrabutylammonium fluoride (TBAF) in DCM to form a 
nucleotide with the phosphoramidite-blocked 5' 
terminus and a free 3 1 hydroxy terminus. Reaction in 
base with DMT-C1 adds a DMT ether blocking group to 
the 3' hydroxy terminus. 

25 The addition of the 3' DMT-, 5' CNE- blocked 

phosphoramidited nucleotide to a linker substrate 
having a free hydroxy terminus then proceeds using the 
previous tetrazole catalyzed reaction, as is well 
known for oligonucleotide synthesis. The resulting 

30 product contains an added nucleotide residue with a 

DMT-blocked 3' terminus, ready for deblocking with DCA 
in DCM and the addition of a subsequent blocked 
nucleotide as before. _ 

The above demonstrates that the present 

35 bifunctional molecules can be synthesized having 
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In the present preferred embodiment for a 
polypeptide library, the following sequence of 
deprotecting is preferred when using the teflon 
support and 5'BCM3 linker described in Examples 2-3: 
5 1) tetrabutyl ammonium fluoride (TBAF) 

treatment to remove TBS and TMS ethyl ethers; 

2) brief 5 minute treatment with 

trif luoroacetic acid (TFA) treatment sufficient to 
remove MTr, Bum, PMC and Trt groups, followed by 
10 neutralization for 5 minutes with triethylamine; 

3) aqueous ammonia treatments to remove Bz and 
CNE groups; and 

4) cleavage of the bifunctional molecule from 
the solid support using a periodate oxidation. 

15 Alternatively, the following sequence of 

deprotection is used on the CPG linker described in 
Example 3 either with or without the photoactively 
cleavable linker moiety: 

1) TBAF treatment to remove TBS and TMS ethyl 
2 0 ethers ; and 

2) aqueous ammonia treatment to remove Bz and 
CNE groups. 

As indicated, after the library has been 
synthesized, and after the protecting groups have been 

25 removed, the bifunctional molecules may be cleaved off 

of the solid support, and the released bifunctional 
molecules separated from the solid phase to form a 
solution comprising a plurality of bifunctional 
molecules. Alternatively, the library may be 

30 maintained in the form of a plurality of bifunctional 
molecules in the solid phase. 

Although natural amino acids are used in the 
Examples, the present invention is not to be so 
limited. The alphabet of possible amino acid residues 

35 can be extended to include any molecule that satisfies 
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facilitates identification of the chemical structure. 

By the present screening methods, one can 
identify optimized chemical structures that 
participate in binding interactions or chemical 
5 catalysis events with a biologically active molecule 

by drawing upon a repertoire of structures randomly 
formed by the combinatorial association of diverse 
chemical units without the necessity of either 
synthesizing them one at a time or knowing their 

10 interactions in advance. 

The invention therefore also contemplates a 
method for identifying a chemical structure that 
participates in a preselected binding or catalysis 
interactions between the chemical structure and a 

15 biologically active molecule. The chemical structure 

to be identified is represented by one of the members 
of a library of this invention, and the method 
comprises the following steps: 

(1) A library according to the present invention 
20 " is admixed with a preselected biologically active 

molecule under binding conditions (i.e., a binding 
reaction admixture) for a time period sufficient for 
the biologically active molecule to interact with at 
least one bifunctional molecule of this invention 
25 present in the library and form a binding reaction 
complex. 

(2) The binding reaction complex is then 
isolated from the library admixture to form an 
isolated complex. 

30 (3) The nucleotide sequence of the polymer 

identifier oligonucleotide present in the isolated 
binding reaction complex is determined. The 
nucleotide sequence provides a code that defines the 
chemical structure that participated in the binding 

35 reaction, and thus determining that sequence 
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phase. Still further, both the library members and 
the biologically active molecule can be in the liquid 
phase . 

Binding conditions, are those conditions 
5 compatible with the known natural binding function of 
the biologically active molecule. Those compatible 
conditions are buffer, pH and temperature conditions 
that maintain the biological activity of the 
biologically active molecule, thereby maintaining the 
ability of the molecule to participate in its 
preselected binding interaction. Typically, those 
conditions include an aqueous, physiologic solution of 
pH and ionic strength normally associated with the 
biologically active molecule of interest. 

For example, where the binding interaction is to 
identify a member in the library able to bind an 
antibody molecule, the preferred binding conditions 
would be conditions suitable for the antibody to 
immunoreact with its immunogen, or a known 
20 immunoreacting antigen. For a receptor molecule, the 
binding conditions would be those compatible with 
measuring receptor-ligand interactions. 

A time period sufficient for the admixture to 
form a binding reaction complex is typically that 
length of time required for the biologically active 
molecule to interact with its normal binding partner 
under conditions compatible yith interaction. 
Although the time periods can vary depending on the 
molecule, admixing times are typically for at least a 
few minutes, and usually not longer than several 
hours, although nothing is to preclude using longer 
admixing times for a binding reaction complex to form. 

A binding reaction complex is a stable product of 
the interaction between a biologically active molecule 
35 and a bifunctional molecule of this invention. The 
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immunoreacts with the biologically active molecule 
present in the binding reaction admixture to form an 
antibody-biological ly active molecule immunoreaction 
complex. Thereafter, by separation of the solid phase 
5 from the binding reaction admixture, the 

immunoreaction complex, and therefor any binding 
reaction complex, is separated from the admixture to 
form isolated bifunctional molecule. 

Alternatively, a binding means can be operatively 

10 linked to the biologically active molecule to 

facilitate its retrieval from the binding reaction 
admixture. Exemplary binding means are one of the 
following high affinity pairs: biotin-avidin, protein 
A-Fc receptor, ferrit in-magnetic beads, and the like. 

15 Thus, the biologically active molecule is operatively 

linked (conjugated) to biotin, protein A, ferritin and 
the like binding means, and the binding reaction 
complex is isolated by the use of the corresponding 
binding partner in the solid phase, e.g., solid-phase 

20 avidin, solid-phase Fc receptor, solid phase magnetic 
beads and the like. 

The use of solid supports on which to operatively 
link proteinaceous molecules is generally well known 
in the art. Useful solid support matrices are well 

25 known in the art and include cross-linked dextran such 
as that available under the tradename SEPHADEX from 
Pharmacia Fine Chemicals (Piscataway, NJ) ; agarose, 
borosilicate, polystyrene or latex beads about 1 
micron to about 5 millimeters in diameter, polyvinyl 

30 chloride, polystyrene, cross- linked polyacrylamide, 
nitrocellulose or nylon-based webs such as sheets, 
strips, paddles, plates microtiter plate wells and the 
like insoluble matrices. 
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the amplified fragments into sequencing vectors. The 
cloning and sequencing of the amplified fragments then 
is a routine procedure that can be carried out by any 
of a number of molecular biological methods known in 
5 the art. 

Alternatively, PCR amplified products derived 
from a population of isolated bifunctional molecules 
can be used as a hybridization probe to selectively 
enrich the quality of the isolated bifunctional 

10 molecules. For example, using the hybridization 

probes, which are modified by biotinylation as shown 
in Figure 1, one can isolate members of the library by 
hybridization, to form an enriched library containing 
only bifunctional molecules that have sequences that 

15 hybridize to the above hybridization probes. In a 

second screening reaction under different binding 
conditions, for example, higher stringency binding 
conditions, one can isolate the species of 
bifunctional molecule that binds most tightly with the 

20 biologically active molecule. 

Thus the library can be manipulated to form 
enriched libraries from which to screen for chemical 
diversity. 

25 4. Polymerase Chain P^rHnn 

For determining the nucleotide sequence 
of the identifier oligonucleotide in the isolated 
complex as part of the methods of this invention, the 
use of the polymerase chain reaction (PCR) is a 

30 preferred embodiment. 

For use in this invention, the identifier 
oligonucleotide are comprised of polynucleotide coding 
strands, such as mRNA and/ or the sense strand of 
genomic DNA. If the genetic material to be assayed is 

35 in the form of double stranded DNA, it is usually 
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phasic with time periods of relative temperature 
stability at each of temperatures favoring 
polynucleotide synthesis , denaturation and 
hybridization. 

5 A plurality of first primer and/or a plurality of 

second primers can be used in each amplification, 
e.g., one species of first primer can be paired with a 
number of different second primers to form several 
different primer pairs. Alternatively, an individual 
10 pair of first and second primers can be used. In any 
case, the amplification products of amplifications 
using the same or different combinations of first and 
second primers can be combined for assaying for 
mutations . 

15 The PCR reaction is performed using any suitable 

method. Generally it occurs in a buffered aqueous 
solution, i.e., a PCR buffer, preferably at a pH of 
7-9, most preferably about 8. Preferably, a molar 
excess of the primer is admixed to the buffer 

20 containing the template strand. A large molar excess 
is preferred to improve the efficiency of the process. 

The PCR buffer also contains the 
deoxyribonucleotide triphosphates (polynucleotide 
synthesis substrates) dATP, dCTP, dGTP, and dTTP and a 

25 polymerase, typically thermostable, all in adequate 
amounts for primer extension (polynucleotide 
synthesis) reaction. The resulting solution (PCR 
admixture) is heated to about 90°C - 100°C for about 1 
to 10 minutes, preferably from 1 to 4 minutes. After 

30 this heating period the solution is allowed to cool to 
54 °C, which is preferable for primer hybridization. 
The synthesis reaction may occur at from room 
temperature up to a temperature above which the 
polymerase (inducing agent) no longer functions 

35 efficiently. Thus, for example, if DNA polymerase is 
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polymerase amplifies the starting polynucleotide as 
has been described by Chamberlin et al., The Enzvmes . 
ed. P. Boyer, pp. 87-108, Academic Press, New York 
(1982) . Amplification systems based on transcription 
5 have been described by Gingeras et al., in PCR 

Protocols, A Guide to Methods » n d AppI i nal-i one pp . 
245-252, Innis et al. , eds, Academic Press, Inc., San 
Diego, CA (1990). 

If the inducing agent is a DNA-dependent RNA 
10 polymerase and, therefore incorporates ribonucleotide 

triphosphates, sufficient amounts of ATP, CTP, GTP and 
UTP are admixed to the primer extension reaction 
admixture and the resulting solution is treated as 
described above. 

15 The new ly synthesized strand and its 

complementary nucleic acid strand form a 
double-stranded molecule which can be used in the 
succeeding steps of the process. 

PCR amplification methods are described in detail 

20 in U.S. Patent Nos. 4,683,192, 4,683,202, 4,800,159, 

and 4,965,188, and at least in several texts including 
PCR Technology: Principles and Anp H cations for ra» 
Amplification , H. Erlich, ed., Stockton Press, New 
York (1989) ; and PCR Proton s: A guide to Methods 
and Applications , innis et al., eds., Academic Press, 
San Diego, California (1990) . 

The term "polynucleotide" as used herein in 
reference to primers, probes and nucleic acid 
fragments or segments to be synthesized by primer 
extension is defined as a molecule comprised of two or 
more deoxyribonucleotides or ribonucleotides, 
preferably more than three. Its exact size will 
depend on many factors, which in turn depends on the 
ultimate conditions of use. 
35 The term "primer" as used herein refers to a 
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non-complementary nucleotide fragment can be attached 
to the 5' end of the primer, with the remainder of the 
primer sequence being substantially complementary to 
the strand. Such non-complementary fragments 
typically code for an endonuclease restriction site. 
Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, 
provided the primer sequence has sufficient 
complementarity with the sequence of the strand to be 
synthesized or amplified to non-randomly hybridize 
therewith and thereby form an extension product under 
polynucleotide synthesizing conditions. 

Primers of the present invention may also contain 
a DNA-dependent RNA polymerase promoter sequence or 
its complement. See for example, Krieg et al., Nucl. 
Acids Res., 12:7057-70 (1984); Studier et al. , J. Mol. 
Biol^, 189:113-130 (1986); and Molecular Cloning- * 
Laboratory Manua l. Second Edition . Maniatis et al., 
eds., Cold Spring Harbor, NY (1989). 

When a primer containing a DNA-dependent RNA 
polymerase promoter is used, the primer is hybridized 
to the polynucleotide strand to be amplified and the 
second polynucleotide strand of the DNA-dependent RNA 
polymerase promoter is completed using an inducing 
agent such as E. coli DNA polymerase I, or the Klenow 
fragment of E. coli DNA polymerase. The starting 
polynucleotide is amplified by alternating between the 
production of an RNA polynucleotide and DNA 
polynucleotide. 

Primers may also contain a template sequence or 
replication initiation site for a RNA-directed RNA 
polymerase. Typical RNA-directed RNA polymerase 
include the QB replicase described by Lizardi et ai., 
Biotechnology, 6:1197-1202 (1988). RNA-directed 
polymerases produce large numbers of RNA strands from 
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synthesis, i.e., initiate a primer extension reaction 
off its 3« terminus. One or both of the primers can 
additionally contain a 5 '-terminal (5 "-most) 
non-priming portion, i.e., a region that does not 
5 participate in hybridization to the preferred 
template. 

5 - Nucleic Acid Semit e Analysis 

Nucleic acid sequence analysis is a 
well known procedure for determining the sequence of 
nucleotides and is applied to the present methods to 
determine the nucleotide sequence in an identifier 
oligonucleotide or PCR amplification product of this 
invention. Nucleic acid sequence analysis is 
approached by a combination of (a) physiochemical 
techniques, based on the hybridization or denaturation 
of a probe strand plus its complementary target, and 
(b) enzymatic reactions with endonucleases, ligases/ 
and polymerases. 

In assays using nucleic acid hybridization, 
detecting the presence of a DNA duplex in a process of 
the present invention can be accomplished by a variety 
of means. 
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In one approach for detecting the presence of a 
DNA duplex, an oligonucleotide that is hybridized in 
the DNA duplex includes a label or indicating group 
that will render the duplex detectable. Typically 
such labels include radioactive atoms, chemically 
modified nucleotide bases, and the like. 

The oligonucleotide can be labeled, i.e., 
operatively linked to an indicating means or group, 
and used to detect the presence of a specific 
nucleotide sequence in a target template. 

Radioactive elements operatively linked to or 
present as part of an oligonucleotide probe (labeled 
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hybridized to a DNA duplex. 

Techniques for the separation of single stranded 
oligonucleotide, such as non-hybridized labeled 
oligonucleotide probe r from DNA duplex are well known, 
5 and typically involve the separation of single 

stranded from double stranded nucleic acids on the 
basis of their chemical properties. More often 
separation techniques involve the use of a 
heterogeneous hybridization format in which the 

10 non-hybridized probe is separated, typically by 
washing, from the DNA duplex that is bound to an 
insoluble matrix. Exemplary is the Southern blot 
technique, in which the matrix is a nitrocellulose 
sheet and the label is 32 P. Southern, J. Mol. Biol. . 

15 98:503 (1975). 

The oligonucleotides can also be advantageously 
linked, typically at or near their 5 • -terminus, "to a 
solid matrix, i.e., aqueous insoluble solid support as 
previously described. 

20 It is also possible to add "linking" nucleotides 

to the 5' or 3' end of the member oligonucleotide, and 
use the linking oligonucleotide to operatively link 
the member to the solid support. 

In nucleotide hybridizing assays, the 

25 hybridization reaction mixture is maintained in the 

contemplated method under hybridizing conditions for a 
time period sufficient for the oligonucleotides having 
complementarity to the predetermined sequence on the 
template to hybridize to complementary nucleic acid 

30 sequences present in the template to form a 

hybridization product, i.e., a complex containing 
oligonucleotide and target nucleic acid. 

The phrase "hybridizing conditions" and its 
grammatical equivalents, when used with a maintenance 

35 time period, indicates subjecting the hybridization 
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Where the nucleic acid containing a target 
sequence is in a double stranded (ds) form, it is 
preferred to first denature the dsDNA, as by heating 
or alkali treatment, prior to conducting the 
5 hybridization reaction. The denaturation of the dsDNA 
can be carried out prior to admixture with a 
oligonucleotide to be hybridized, or can be carried 
out after the admixture of the dsDNA with the 
oligonucleotide . 
10 Effective amounts of the oligonucleotide present 

in the hybridization reaction admixture are generally 
well known and are typically expressed in terms of 
molar ratios between the oligonucleotide to be 
hybridized and the template. Preferred ratios are 
15 hybridization reaction mixtures containing equimolar 
amounts of the target sequence and the , 
oligonucleotide. As is well known, deviations from 
equal molarity will produce hybridization reaction 
products, although at lower efficiency. Thus, 
although ratios where one component can be in as much 
as 100 fold molar excess relative to the other 
component, excesses of less than 50 fold, preferably 
less than 10 fold, and more preferably less than two 
fold are desirable in practicing the invention. 
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Examples 

The following examples are intended to 
illustrate, but not limit, the present invention. 



1 - Preparat ion of Protected Amino Acids 

The synthesis of a bifunctional molecule requires 
protected amino acids. The amino-terminus of the 
amino acid is protected with f luorenylmethoxycarbonyl 
(FM0C) and the car boxy- terminus is protected with a 
pentaf luorophenyl ester (Opfp) . The amino acids 
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atmosphere to form FMOC-Tyr (OTBS) -Opfp. 

FMOC-Ser(OTBS)-Opfp is similarly prepared using 
FMOC-Ser(tert-butyl)-Opfp (Bachem) in the reaction. 
FMOC-Thr (OTBS) -Opfp is also prepared in this manner 
5 using FMOC-Thr (tert-butyl) -Opfp (Bachem) . 

FMOC-Asp(TMSE)-Opfp having a trimethylsilyl ethyl 
ester (TMSE) on the side chain carboxyl group of 
aspartic acid is prepared by first reacting one 
equivalent of FMOC-Asp-O-tert butyl (Bachem) with 1.5 

10 equivalents of 2-trimethylsilylethanol and 1.5 

equivalents if dicyclocarbodiimide (DCC) in ethyl 
acetate for 12 hours at room temperature under inert 
atmosphere to form FMOC-Asp(OTMSE)-0-tertbutyl. There 
after the TMSE ester is reacted with an excess of 

15 formic acid at room temperature for 14 hours to 
hydrolyze the tertbutyl moiety and form a free 
carboxyl terminus in the form of FMOC-Asp(OTMSE) -COOH. 
The formic acid is evaporated, and the 1 equivalent of 
the remaining amino acid is admixed with 1.1 

20 equivalent of pentaf luorophenol (pfp; Bachem) 

and 1.1 equivalent of DCC for 12 hours at room 
temperature under inert atmosphere to form the product 
FMOC-Asp (TMSE) -Opfp. The product is isolated from 
unreacted pfp, DCC and precursor amino acid by silica 

25 gel chromatography using 10% (v/v) ethyl acetate in 
hexane . 

FMOC-Glu (TMSE) -Opfp having a TMSE ester on the 
side chain carboxyl group of glutamic acid is prepared 
as described above to prepare FMOC-,TMSE- and pfp 
30 protected aspartic acid, except that FMOC-Glu-O-tert- 
butyl (Bachem) is used in place of the aspartic acid 
precursor, to form FMOC-Glu (TMSE) -Opfp . 
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retained and admixed with 2 equivalents of Iodine in 
tetrahydrofuran/ water, 9:1, (available from Glen 
Research) under inert atmosphere and maintained at 
room temperature for 10 minutes to form oxidized 
5 solid-support coupled linker. 

One equivalent of oxidized solid support- 
coupled linker was then admixed with 20 equivalents of 
acetonitrile/acetic anhydride, 88:12, (capping 
reagent; Glen Research) for 10 minutes at room 
temperature to cap any unreacted free hydroxyls 
present on the solid support and form capped teflon- 
solid support-coupled linker. 
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b - Controlled Pore Glass (CP G) Support 

A controlled pore glass (CPG) support and 
linker is particularly preferred that can also be used 
in the present invention. The complete structure of 
an exemplary CPG support is shown in Figure 3, and 
includes (1) a sarcosine linker moiety connecting the 
linker to the CPG support, (2) a succinyl-aminohexanol 
linker that is cleavable by aqueous ammonia to release 
the polypeptide-linker-oligonucleotide conjugate (A-B- 
C) from the solid support, and (3) a serine branch- 
monomer having a photolabile cleavage site for 
releasing the oligonucleotide from the solid support. 
The synthesis is carried out stepwise as described 
below. 

N-Faoc-amidohexan-i-ol (i) was first prepared to 
form the aminohexanol linker moiety. To that end, 
30 6-Amino-l-hexanol (0.75 g, 6.4 mmol) was dissolved in 

sat. aq. Na 2 C0 3 (10 ml) and cooled on ice. 
9-Pluorenylmethyl chloroformate (Pmoc-Cl, 1.83 g, 7.1 
ranol) i n THF (25 ml, freshly distilled) was added 
slowly under vigorous stirring. The solution is 
acidified with 10% (w/v) citric acid, extracted with 
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added with stirring at room temperature under an inert 
atmosphere (Ar) . The reaction mixture was stirred 
over night, evaporated to dryness, redissolved in 
chloroform (4 x 25 ml) and extracted with sat. aq. 
5 NaHC0 3 (50 ml) . The aq. phase was back-extracted once 
with chloroform (25 ml) and the combined organic 
phases were dried from Na 2 S0 4 , filtered, and 
evaporated to yield the crude product as a brownish 
oil. The oil was dissolved in chloroform/ ethylacetate 

10 and triturated once with hexane and then purified by 

silica gel column chromatography (150 ml silica) using 
CHCl 3 /MeOH/DIPEA, 94/5/1, as the mobile phase. The 
fractions were analyzed by tic and the appropriate 
fractions were pooled and evaporated to give a white 

15 foam designated compound 3. The yield of compound 3 
was 4.29 g, and the 

MS (EI, m/e) was 630 (calcd for C^jNC^ + H 630). 

Controlled pore glass (CPG) was activated for . 
coupling to the above prepared linkers. To that end, 

20- CPG (Sigma G5019, aminopropyl-CPG, 4.97 g) was 

suspended in trichloroacetic acid in dichloromethane 
(3%, 20 ml) and agitated in a shaker for 4 hours. The 
CPG was isolated by filtration, washed three times 
with dichloromethane (20 ml) , three times with DIPEA 

25 in chloroform (10%, 20 ml) and three times with 

diethyl ether to form activated CPG. The activated 
CPG was dried in vacuo. 

Bi-functional CPG [bf-CPG, 
CPG-8ar-suc-aho-8er (O-DMT) (tf-Fmoc) ] was then prepared 

30 comprising all of the element shown in Figure 3. 

i. Activated CPG (2.0 g, loading 83 
jimole/g) was placed in a filter funnel (25 ml). 
Fmoc-Sar (313.5 mg, mmol) was dissolved in DMF (2.8 
ml), mixed with pyBOP (525.9 mg, mmol) in DMF (2.8 ml). 

35 and the combined, activated amino acid mixture was 
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to light of about 350 run and which upon cleavage 
results in the release of soluble polypeptide. 
Cleavage can be regulated by the time and intensity of 
irradiation using, for example, a Rayonet RPR Reactor 
5 as described by Barany et al. , supra . 

4. Coupling of Nucleotide to Solid Support 
a. Deprotect DMT 

The capped teflon solid support-coupled 

0 linker prepared in Example 3 was admixed with 3% DCA 
in DCM for 10 minutes at room temperature under inert 
atmospheres to remove the DMT protecting group from 
the linker and form a free hydroxyl group. The 
deprotected linker/ support was then washed 3 times 

5 with DCM. The deprotected linker /support is ready for 
addition of nucleotide. 



b. Addition of Nucleotide to Teflon Support 

One equivalent of deprotected linker/ support 

20- was admixed with about 20 equivalents of a desired 

blocked nucleotide phosphoramidite and 20 equivalents 
of tetrazole (0.45M in acetonitrile) to form a coupled 
nucleotide/ linker/ support (coupled nucleotide 
complex) • The coupled nucleotide complex was then 

25 washed with an excess of acetonitrile to remove 
unreacted reagents. All blocked nucleotide 
phosphoramidites were obtained from Glen Research and 
contain a DMT protected blocked 5' hydroxyl, a 
cyanoethyl ester (CNE) and a diisopropylamine group at 

30 - the 3 '-phosphoramidite. In addition, the adenine and 
cytosine derivatives contained a benzoyl group on the 
base f s free nitrogen and the guanosine derivative 
contains an isobutyl group on the 2 -amino group of the 
purine base. 

35 One equivalent of coupled nucleotide complex 
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was then admixed with 2 equivalents of Iodine in 
tetrahydrofuran/water, 9:1, for oxidation as before in 
Example 3 to oxidize the coupled nucleotide complex. 
Thereafter, one equivalent of oxidized 
5 nucleotide complex was admixed with 20 equivalents of 

capping reagent as before in Example 3 to cap any 
unreacted free hydroxyls and to form capped solid 
phase-coupled nucleotide complex* 

10 c. Coupling of Nucleotide to Bifun ctional CPG 

As a demonstration of coupling chemistry for 
addition of a nucleotide residue to the bifunctional 
CPG support, we have synthesized and coupled an 
oligodeoxyribonucleotide onto bf-CPG. To that end, 
15 oligonucleotides were synthesized on a ABI 394 DNA 

synthesizer using the standard 1 jraole scale synthesis 
cycle and commercially available reagents and 
phosphoramidites . The solid support used was the 
bf-CPG (20 mg, 1 Mmol) placed in a commercially 
available empty synthesis column for the ABI 
synthesizers. Repetitive yield were calculated to 
98.9% as judged from collecting the detritylation, 
deluting in p-toluene sulfonic acid monohydrate in 
acetonitril (0.1 M) and measuring the absorptions at 
25 498 nm. 

The oligonucleotide was released from the support 
by cone, aq. ammonia (shaking for more than 24 hours) . 
The oligonucleotide was isolated using commercially 
available OPC-cartridges and analyzed by HPLC and PAGE 
3 0 (radiolabeled with 32 P-g-ATP and T4-kinase) . 

5 - Coupling of Amino Acid to Solid Support 
a. Deprotect Nucleotide Comp lex 

One equivalent of capped teflon solid phase- 
35 coupled nucleotide complex was admixed with 1 



20 
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_ device. Reagents used were the 



10 



commercially available Fmoc amino acid derivatives 
either as O-pfp esters (for Val and His[Fmoc]) or the 
free acid acivated with pyBOP (for Phe, Gin and Pro). 
Coupling was conducted in the automated peptide 

synthesizer using conventional 

chemistry. The success of the coupling reaction was 
monitored by a positive and a negative Kaiser test 
after the Fmoc deprotection and the coupling steps, ' 
respectively. 

The peptide was deprotected and released from an 
aliquot of the bf-CPG by treatment with i. piperidine 
in DMF (2/8, 2 x 5 minutes) ii. cone, aq. ammonia 
(shaking for 24 hours) . The peptide was analyzed by 
15 reversed phase HPLC as one major peak. MS (Ion Spray, 

m/e) 813 (calcd for C 39 H 60 N 10 O 9 + H 813). 

6. Elongation of the Conjugate 

The conjugate can be lengthened by alternating 
20 cycles of addition of nucleotides and amino acids. The 
following alternating cycles are repeated until the 
conjugate has desired length amino acid polymer and 
oligonucleotide polymer. 



a. Addition of Nucleotides 

To couple an additional nucleotide, the 5'- 
OH on the terminal nucleotide is deprotected with DCA 
following the protocol described previously in Example 
4A for the deprotection of the teflon linker/ support. 
Thereafter, a protected nucleotide is added as 
described in Example 4B. 

b - Addition of Amino Acids 

To couple an additional amino acid, the 
amino-FMOC terminus of the last amino acid added to 
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(0*1 M) .and measuring the absorptions at 498 nm. 

- The 45-mer oligonucleotide above has two flanking 
15-mer regions that provide a site for PCR primers, 
and a middle 15-mer that is the coding sequence for 
5 the peptide using (arbitrarily) the commaless codons 

CTA for F, ATC for H, ACC for P, ACA for Q and GCG for 
V. 

The peptide-oligonucleotide conjugates formed 
were realesed from the support by cone, aq. ammonia 
10 (shaking for more than 24 hours) . The 

peptide-oligonucleotide conjugates were isolated using 
commercially available OPOcartridges and analyzed by 
HPLC and PAGE (radiolabeled with 32 P-g-ATP and 
T4 -kinase) . 

15 

Synthes is of a Peptide-Oligonucleotide 
Conjugate Lib rary on a bf-CPG Support 
Using a bifunctional (bf ) CPG support as 
described in Example 3B, a peptide-oligonucleotide 
20 conjugate library is constructed as described below in 
the following steps: 

Step l. The solid support, bf-CPG, is placed in 
a commercially available empty synthesis column for 
use in an ABI 394 DNA synthesizer (Applied Biosystems, 
25 Inc., Foster City, CA) . The 3»- oligonucleotide 

primer sequence (SEQ ID NO ) is synthesized and 

then chemically attached to the bf-CPG support 
according to standard manufacturer's procedures using 
the standard synthesis cycles and commercially 
30 available reagents and phosphoramidites . 

Step 2. The support after Step 2 is divided into 
a number of aliquots representing the number of 
different monomer amino acids to be added at library 
position 1 of the polypeptides to be synthesized. 
35 Each Fmoc-amino acid residue Monomer (1) is coupled 
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CPG conjugate. 

Step 4. The different Monomer (1) -functionalized 
bf-CPG's are pooled, mixed and divided into a number 
of aliquots representing the number of different amino 
5 acid residue monomers to be added in the library at 
position 2. Monomer (2) is attached to the CPG as 
described under Step 2 for Monomer (1) . 

Step 5. The oligonucleotide sequence that is the 
unit identifier for the appropriate Monomer (2) is 

10 attached to each of the Monomer (1) -functionalized 
bf-CPG's by placing each of the different 
Monomer (l) -functionalized bf-CPG's in separate 
commercially available empty synthesis columns for the 
DNA synthesizers. 

15 These steps, Steps 4-5, are repeated until the 

appropriate number of monomers and corresponding unit 
identifier oligonucleotides have been incorporated. 

Step 6. The combined solid supports are placed 
in a commercially available empty synthesis column for 

20 the DNA synthesizers. The 5' -oligonucleotide pririer 
sequence is chemically attached to the 
Monomer (n) -Monomer (1) -functionalized bf -CPG as 
described previously using the standard synthesis 
cycles and commercially available reagents and 

25 phosphoramidites. The final DMT-group is left 

attached to the oligonucleotide-peptide conjugates to 
be used as an affinity tag in the purification step. 

Step 7. The combinatorial 
peptide-oligonucleotide library is deprotected and 

30 released from the support by treatment first with TBAF 
for the appropriate time, and then with cone, aq. 
ammonia (shaking for more than 24 hours) . The 
peptide-oligonucleotide conjugates were isolated and 
purified using commercially available OPC-cartridges 

35 
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ii. Removal of the Bz Group 

The conjugate is admixed with an excess 
of aqueous ammonia and maintained at 60 °C overnight 
under inert atmosphere to remove the benzyl (Bz) group 
5 protecting the side chain amino group of lysine. 

iii. Removal of the MTr . Bum and Tpm Groups 
The conjugate is admixed with 20 to 50 

percent TFA and maintained at room temperature for 
10 about 5 minutes under inert atmosphere to remove the 
MTr, Bum or Tpm groups protecting the side chains of 
arginine, histidine or cysteine, respectively. 
Thereafter, the conjugate is neutralized with 
triethanolamine and CHC1 3 . 

15 

iv. Removal of the Formvl Group 

The conjugate is admixed with aqueous 
buffer at pH 12 and maintained at room temperature for 
about 5 minutes under inert atmosphere to remove the 
20^ formyl group protecting the reactive 2-amino group of 
tryptophan. 

8. Cleavage of Conjugate from Solid Support 
a. Cleavage of the Teflon Support 

25 After the protecting groups are removed from 

the conjugate on the teflon support, the bifunctional 
molecule is removed from the solid support by admixing 
the conjugate with a cleaving solution of 100 mM 
sodium periodate, 100 mM sodium phosphate buffer, pH 

30 7.2, in acetonitrile /water (1:4 v/v) . The admixture 
is maintained with agitation at room temperature with 
exclusion of light. After 4 hours of agitation, the 
liquid phase removed and the solid support is washed 
with excess water and methanol. The wash solutions 

35 are then removed and 1 umole of solid support are 
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9. Prepara tion of a Library of Bifunctional 
Molecules 

Using the synthetic procedures of Examples 1-8, 
the methods for producing a bifunctional molecule are 
detailed. To form a library of molecules, additional 
manipulations are required. First, the synthesis is 
conducted including the steps of aliquoting, adding 
different units to each aliquot, and pooling the 
aliquots to sequentially build the library. Second, 
if desired, the PGR primer binding sites and the unit 
identifier oligonucleotides can be added as 
presynthesized blocks rather than added nucleotide by 
nucleotide . 

a. Synthesis of Prote cted Oligonucleotides 
Using this procedure, PCR primer 
binding site oligonucleotides Pi and P2 were 
synthesized having the nucleotide sequences shown in 
Table 2, but having an DMT at the oligonucleotide's 5' 
terminus, and having a CNE ester and an amino 
diisopropyl phosphoramidate at the oligonucleotide's 
3 1 terminus. Similarly, unit identifier 
oligonucleotides were synthesized for glycine (Gly) 
and methionine (Met) having 6 nucleotides per unit and 
having the blocked termini described above. The unit 
identifier oligonucleotide sequences are shown in 
Table 2. 



TABLE 2 



30 



Designation 



Oligonucleotide Sequence 
5 ' -GGGCCCTATTCTTAG-3 ' 
5 1 -AGCTACTTCCCAAGG-3 » 



PI 




5 1 -ACGGTA-3 • 



5 1 -CTCATG-3 • 



35 
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ACGGTACTCATG-Pl-LINK-gly . met , and 
ACGGTAACGGTA-Pl-LINK-met . met . 
The two aliquots are then pooled to form a mixture of 
the four bifunctional molecules. 
5 In Step 3 # the pool from Step 2 is divided into 

two aliquots. The first aliquot is subjected to a 
sequential coupling as before, adding glycine and the 
oligonucleotide Z 9ly . Thereafter, protected 
oligonucleotide P2 is coupled to the growing 
10 bifunctional molecules in the pool as described for a 
single protected nucleotide in Example 4 to form the 
structures : 

P2CTCATGCTCATGCTCATGPl-LINK-gly . gly . gly , 
P2CTCATGCTCATGACGGTAPl-LINK-met . gly . gly , 
15 P2CTCATGACGGTACTCATGPl-LINK-gly . met . gly , and 

P2CTCATGACGGTAACGGTAPl-LINK-met . met . gly . 
The second aliquot is subjected to a sequential 
coupling as before, adding methionine and the 
oligonucleotide Z"**. Thereafter , protected 
20 oligonucleotide P2 is coupled to the growing 

bifunctional molecules in the pool as described for a 
single protected nucleotide in Example 4 to form the 
structures : 

P2ACGGTACTCATGCTCATGPl-LINK-gly.gly.met , 
25 P2 ACGGTACTCATGACGGTAPl-LINK-met . gly . met , 

P2 ACGGTAACGGTACTCATGPl-LINK-gly . met • met , and 
P2 ACGGTAACGGTAACGGTAPl-LINK-met . met . met . 
The two aliquots are then pooled to form a mixture of 
the eight bifunctional molecules. 
30 The resulting pool of eight different 

bifunctional molecules represents a small library 
produced according to the methods of this invention. 
By increasing the alphabet size one increases the 
number of aliquots per step. 

35 
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What Is Claimed Is: 

1. A bifunctional molecule according to the 
formula A-B-C, wherein A is a chemicals moiety, B is a 
linker molecule operatively linked to A and C, and C 

5 is an identifier oligonucleotide comprising a sequence 

of nucleotides which sequence identifies the structure 
of chemical moiety A. 

2. The bifunctional molecule of claim 1 wherein 
A is a polymer comprising a linear series of chemical 

10 units represented by the formula (X n ) a , wherein X is a 

single chemical unit in polymer A; and 
identifier oligonucleotide C is represented by the 
formula (Z n ) a , wherein Z is a unit identifier 
nucleotide sequence within oligonucleotide C that 

15 identifies the chemical unit X at position n; and 

wherein 

n is a position identifier for both X in polymer 
A and Z in oligonucleotide C having the value of l+i. 
where i is an integer from 0 to 10, such that when n 
20 is 1, X or Z is located most proximal to the linker, 

and a is an integer from 4 to 50. 

3. The bifunctional molecule of claim 2 wherein 
said unit identifier nucleotide sequence Z has a 
length of from 2 to 8 nucleotides. 

25 4 • The bifunctional molecule of claim 1 wherein 

said chemical moiety A is an oligosaccharide, 
polypeptide, glycolipid, lipid, proteoglycan, 
glycopeptide or oligonucleotide. 

5. The bifunctional molecule of claim 2 wherein 

30 said polymer A is a polypeptide, X is an amino acid 
residue in said polypeptide, and unit identifier 
nucleotide sequence Z is a hexanucleotide sequence 
that identifies the amino acid residue at position n 
in polypeptide A. 

35 6 - The bifunctional molecule of claim 5 wherein 
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16. The library of claim 12 wherein X is an 
amino acid and said unit identifier nucleotide 
sequence Z has a length from 3 to 6 nucleotides. 

17. The library of claim 11 wherein each of said 
5 species of bifunctional molecules in said plurality is 

present in molar equivalents of from 0.2 to 10.0. 

18. The library of claim 12 wherein said 
identifier oligonucleotide C in each of said species 
of bifunctional molecules has a nucleotide sequence 

10 according to the formula Pl-(Z n ) a -P2, where PI and P2 
are nucleotide sequences that provide PCR primer 
binding sites adapted to amplify the identifier 
oligonucleotide, and where the nucleotide sequences of 
PI and P2 are shared by all bifunctional molecule 

15 species in the library. 

19. A method for identifying a chemical 
structure that participates in a preselected binding 
interaction with a biologically active molecule, said 
chemical structure being present in a library of 

20 bifunctional molecules according to claim 11, 
comprising the steps of: 

a) admixing in solution said library of 
bifunctional molecules with the biologically active 

_ molecule under binding conditions for a time period 
25 sufficient to form a binding reaction complex; 

b) isolating the complex formed in step 

(a) ; and 

c) determining the nucleotide sequence of 
the identifier oligonucleotide in the isolated complex 

30 and thereby identifying the chemical structure that 
participated in the preselected binding interaction. 

20. The method of claim 19 wherein said 
biologically active molecule is affixed to a solid 
support . 

35 21. The method of claim 19 wherein said 
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to add an additional chemical unit X and corresponding 
identifier oligonucleotide Z to the bifunctional 
molecules in the admixture, 

26. The method of claim 25 wherein said 

5 repetition of steps (c) and (d) are repeated on the 

admixture from 1 to 6 times, thereby forming a polymer 
A on said bifunctional molecules such that a is from 3 
to 10. 

27. The method of claim 19 wherein said linker 
10 molecule is a bifunctional solid support selected from 

the group consisting of bf-CPG or o-NB-bf-CPG. 

28. The method of claim 24 wherein said linker 
molecule is a bifunctional solid support selected from 
the group consisting of bf-CPG or o-NB-bf-CPG. 

15 29. A bifunctional solid support for 

synthesizing an oligopeptide/ oligonucleotide 

conjugate comprising: 

a solid support, said solid support being of 

a type which is dispersible in aqueous solution, 
20 a first linkage unit coupled to said solid 

support, 

a second linkage unit coupled to said first 
linkage unit, and 

a bifunctional unit coupled to said second 
25 linkage unit, 

said bifunctional unit having a first 
leaving group employable for oligopeptide synthesis 
and a second leaving group employable for 
oligonucleotide synthesis, 
30 said first leaving group being N-FMOC or its 

functional equivalent, 

said second leaving group being O-DMT or its 
functional equivalent, 

said second linkage unit being coupled to 
35 said first linkage unit by means of a bond cleavable 
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34. A bifunctional solid support as described in 
claim 33 wherein: 

said solid support is aminopropyl-CPG, 
said first linkage unit includes a sarcosine 
5 linker coupled to the aminopropyl-CPG and a succinyl 
linker coupled to the sarcosine linker by means of an 
amide bond, 

said second linkage unit includes an 
aminohexanol group coupled to said succinyl linker by 
10 means of an alkyl ester, 

said bifunctional unit includes an L-serine 
residue, the amino end of the serine being coupled by 
means of an amide bond to said aminohexanol linker, 
the carboxyl end of said serine being coupled to an 
15 FMOC leaving group, the hydroxy 1 end of said serine 
being coupled to an O-DMT leaving group. 

35. A bifunctional solid support as described in 
claim 29 further comprising: 

a third linkage unit interposed between and 
20 coupled to said bifunctional unit and said first 

leaving group, 

said third linkage unit being cleavable by 
exposure to ultra- violet light. 

36. A bifunctional solid support as described in 
25 claim 35 wherein said third linkage unit includes a 3- 

nitro-4-O-ethyl benzoate group coupled by means of an 
amide bond to the amino end of said serine and coupled 
by means of an ester bond to an FMOC blocked amino 
acid. 

30 37. A bifunctional solid support as described in 

claim 36 wherein: 

said solid support is aminopropyl-CPG, 
said first linkage unit includes a sarcosine 
linker coupled to the aminopropyl-CPG and a succinyl 
35 linker coupled to the sarcosine linker by means of an 
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ultraviolet light. 

41. An element of a library of 
oligopeptide/oligonucleotide conjugates comprising: 

a bifunctional unit, 
5 an oligopeptide attached to said 

bifunctional unit, and 

an oligonucleotide attached to said 
bifunctional unit. 

42. An element of a library of 

10 oligopeptide/ oligonucleotide conjugates as described 
in claim 41 further comprising a cleavable bond for 
coupling said bifunctional unit to said oligopeptide, 
said cleavable bond being cleavable by exposure to 
ultraviolet light. 

15 
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PI - LINK 

| Step 1 

CTCATG-Pl-LINK-gly 

ACGGTA-P1 -LINK-met 

| Step 2 

CTCATGCTCATG-Pl-LINK-gly . gly 
CTCATGACGGTA-Pl-LINK-met . gly 

ACGGTACTCATG-Pl-LINK-gly .met 
ACGGTAACGGTA-P1- LINK-met .met 

| Step 3 

P2 CTCATGCTCATGCTCATGPl-LINK-gly . gly . gly 
P2CT.CATGCTCATGACGGTAPl-LINK-met . gly . gly 
P2 CTCATG ACGGTACTCATGP 1 - LI NK-g 1 y . met . gly 
P2 CTCATGACGGTAACGGTAP 1 -LINK-met . met . gly 

P2ACGGTACTCATGCTCATGPl-LINK-gly . gly . met 
P2ACGGTACTCATGACGGTAP1 -LINK-met . gly ..met 
P2ACGGTAACGGTACTCATGPl-LINK-gly . met . met 
P2 ACGGTAACGGTAACGGTAPl-LINK-met . met . met 



PI = GGGCCCTATTCTTAG 
P2 « AGCTACTTCCCAAGG 



FIG. 2 
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